An automatic method for extracting citations from Google Books
نویسندگان
چکیده
Recent studies have shown that counting citations from books can help scholarly impact assessment and that Google Books (GB) is a useful source of such citation counts, despite its lack of a public citation index. Searching GB for citations produces approximate matches, however, and so its raw results need timeconsuming human filtering. In response, this article introduces a method to automatically remove false and irrelevant matches from GB citation searches in addition to introducing refinements to a previous GB manual citation extraction method. The method was evaluated by manual checking of sampled GB results and comparing citations to about 14,500 monographs in the Thomson Reuters Book Citation Index (BKCI) against automatically extracted citations from GB across 24 subject areas. GB citations were 103% to 137% as numerous as BKCI citations in the humanities, except for tourism (72%) and linguistics (91%), 46% to 85% in social sciences, but only 8% to 53% in the sciences. In all cases, however, GB found substantially more citing books than did BKCI, with BKCI's results coming predominantly from journal articles. Moderate correlations between the GB and BKCI citation counts in social sciences and humanities, with most BKCI results coming from journal articles rather than books, suggests that they could measure the different aspects of impact, however.
منابع مشابه
Can the impact of non-Western academic books be measured? An investigation of Google Books and Google Scholar for Malaysia
Citation indicators are increasingly used in book-based disciplines to support peer-review in the evaluation of authors and to gauge the prestige of publishers. However, since global citation databases seem to offer weak coverage of books outside the West, it is not clear whether the influence of non-Western books can be assessed with citations. To investigate this, citations were extracted fro...
متن کاملPatent Citation Analysis with Google1
Citations from patents to scientific publications provide useful evidence about the commercial impact of academic research but automatically searchable databases are needed to exploit this connection for large scale patent citation evaluations. Google covers multiple different international patent office databases but does not index patent citations or allow automatic searches. In response, thi...
متن کاملRule based Autonomous Citation Mining with TIERL
Citations management is an important task in managing digital libraries. Citations provide valuable information e.g., used in evaluating an author's influences or scholarly quality (the impact factor of research journals). But although a reliable and effective autonomous citation management is essential, manual citation management can be extremely costly. Automatic citation mining on the other ...
متن کاملCroatian Medical Journal citation score in Web of Science, Scopus, and Google Scholar.
AIM To analyze the 2007 citation count of articles published by the Croatian Medical Journal in 2005-2006 based on data from the Web of Science, Scopus, and Google Scholar. METHODS Web of Science and Scopus were searched for the articles published in 2005-2006. As all articles returned by Scopus were included in Web of Science, the latter list was the sample for further analysis. Total citati...
متن کاملAlternative Metrics for Book Impact Assessment: Can Choice Reviews be a Useful Source?
This article assesses whether academic reviews in Choice: Current Reviews for Academic Libraries could be systematically used for indicators of scholarly impact, uptake or educational value for scholarly books. Based on 451 Choice book reviews from 2011 across the humanities, social sciences and science, there were significant but low correlations between Choice ratings and citation and non-cit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JASIST
دوره 66 شماره
صفحات -
تاریخ انتشار 2015